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1.  Introduction  and  Summary 

Economists  have  used  the  traditional  information  theory  measures 
such  as  "entropy"  and  "redundancy"  to  measure  the  extent  to  which  busi¬ 
ness  is  concentrated  in  the  control  of  giant  firms.  Hart  (1971)  has 
compared  these  measures  with  the  classical  statistical  measures  of  dis¬ 
persion,  and  other  measures  of  business  concentration  derived  from  the 
"Lorenz  curve."  More  recently,  the  sample  entropy  has  been  used  to 
develop  a  test  of  goodness  of  fit  for  normality  (Vasicek,  1976).  The 
redundancy  has  also  been  used  as  a  measure  for  ordering  any  two  distri¬ 
butions  within  the  class  of  "star  ordered"*  distributions  (Chandra  and 
Singpurwalla,  1980).  This  ordering  is  useful  for  discussing  the  unbias¬ 
edness  of  certain  goodness  of  fit  tests;  see,  for  example,  Sections  3.1 
and  3.2. 

In  this  paper,  we  consider  an  estimator  of  the  redundancy,  and 
obtain  its  sampling  properties.  We  show  that  the  estimator  is  almost 


*Let  Fj  and  be  two  continuous  distributions  on  [O,00)  ; 

then  F.  is  said  to  be  star  ( anti-star )  ordered  with  respect  to  F.  , 

-l1  _i 

if  F  F  (x)/x  is  nondecreasing  (nonincreasing)  in  x  for  0£x<F  (1) 
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sure  consistent,  and  is  asymptotically  normal.  We  use  these  properties 
to  propose  a  scale-free  test  for  exponentiality  based  on  the  sample  re¬ 
dundancy,  and  obtain  the  asymptotic  relative  efficiency  of  this  test 
compared  to  tests  based  on  the  maximum  likelihood  estimates.  Percentage 
points  of  the  test  statistic  for  finite  samples  are  obtained  via  a  Monte 
Carlo  experiment.  The  power  of  this  test  against  various  alternatives 
is  compared  with  a  recently  proposed  test  for  exponentiality  based  on 
the  Gini  index  (or  equivalently,  the  total  time  on  test  statistic)  using 
a  Monte  Carlo  experiment. 

Our  conclusion  is  that  a  test  for  exponentiality  based  on  the 
sample  redundancy  is  unbiased  in  the  sense  of  Lehman  and  Scheffe  fcf. 
Ferguson  (1967,  p.  224)],  and  that  it  performs  as  well  as  a  test  for  ex¬ 
ponentiality  ba3ed  on  the  Gini  index  for  the  gamma,  the  Weibull,  the 
uniform,  and  the  Pareto  alternatives.  These  distributions  are  either 
star  or  anti-star  ordered  with  respect  to  the  exponential  distribution. 
The  same  conclusion  also  holds  for  the  lognormal  alternative,  when  the 
parameters  of  this  distribution  are  so  chosen  that  it  is  either  star  or 
anti-star  ordered  with  respect  to  the  exponential  distribution.  When 
the  lognormal  alternative  fails  to  be  either  of  the  above,  then  neither 
the  test  based  on  the  sample  redundancy  nor  the  test  based  on  the  Gini 
index  should  be  used;  this  is  because  it  is  not  possible  to  claim  un¬ 
biasedness  of  these  tests.  However,  should  we  decide  to  go  ahead  and 
use  these  tests,  then  we  find  that  the  test  based  on  redundancy  appears 
to  have  a  slight  advantage  over  that  based  on  the  Gini  index,  with  re¬ 
spect  to  power:  both  the  tests  have  low  power.  Finally,  the  asymptotic 
relative  efficiency  of  the  test  based  on  redundancy  for  the  gamma  and 
the  Weibull  alternatives  is  also  comparable  to  that  based  on  the  Gini 
index. 


We  have  no  compelling  reason,  other  than  ease  of  computation  and 
the  slight  advantage  of  power,  to  recommend  a  test  for  exponentiality 
based  on  redundancy  over  other  available  tests.  The  main  purpose  of 
this  paper,  then,  is  to  point  out  the  possibility  of  using  the  sample 
redundancy  as  another  means  for  testing  for  exponentiality,  and  to 
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describe  the  circumstances  under  which  the  tests  based  on  the  redundancy 
and  the  Gini  index  should  not  be  used. 

2.  Redundancy  and  Its  Estimation 

For  a  nonnegative  random  variable  X  with  a  distribution  func¬ 
tion  F  and  probability  density  function  f  ,  the  redundancy  is 

defined  [see,  for  example,  Thiel  (1967,  p.  96)]  as 

*F  ■  E(s  j)  ■ 

where  p  =  E(X)  is  assumed  to  be  finite. 

Let  X, ,...,X  be  a  random  sample  from  F  ,  and  let  Y.  * 

I  n  r  l 

Xi^i  Xi  ’  •  The  sampT-e  entropy  of  Y_,...,Y  is  defined 

by  Hart  (1971)  as 

n 

H(Y)  -  -  l  Y  log  Y  .  (2.2) 

i 

Note  that  H(Y)  attains  its  maximum  value  log  n  when  Y^  =  1/n  ,  for 

1*1,..., n  .  Vasicek  (1976)  proposes  a  test  for  normality  based  on  the 
sample  entropy  of  f  . 

The  difference  between  the  sample  entropy  H(Y)  and  its  maximum 
value  log  n  is  called  the  sample  redundancy,  .  Thus, 

Rn  *  log  n  -  H(Y) 

n  n 

=  l  Y  log(nY  )  ,  since  l  Y  *  1  . 
i 

Some  further  simplifications  lead  us  to  write 

1  n 

R  *  —  l  x.  log  X.  -  log  X  ,  (2.3) 

nX  i  1 

where  X  ■  ]F"  X^/n  . 
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In  Theorem  2.1  below,  we  shall  show  that  R^  — — ->  R^,  ,  and  that 
Is  asymptotically  normally  distributed.  Thus  R^  is  an  almost  sure 
consistent  estimator  of  R_  . 


2.1  Sampling  Properties  of  R 


In  what  follows  the  following  abbreviations  will  be  used. 

If  {X  }  is  a  sequence  of  random  variables,  and  if  {b  }  is  any 
n  n 

other  sequence,  then  the  notation  "X  =0  (b  )"  denotes  the  fact  that 

n  p  n 

(X  /b  )  ^  0  ,  where  "  £  "  denotes  convergence  in  probability.  The  nota- 
n  n 

D  2 

tion  "  -*■  N(y,0  )  "  denotes  convergence  in  distribution  to  a  normally 

2 

distributed  random  variable  with  mean  y  and  variance  o 

Let  E(XlogX)  =  a  ,  where  a  is  finite,  and  let  E(XlogX)2  <  00 

2  2  aX  2 

also,  let  a  ( F )  «  (1/y  )E(XlogX  -  -jp  -  X  +  y)  •  Then  as  n  -*>  ®  , 

Theorem  2.1:  /n  (R^-R^,)  5  n(o,02(F))  . 

Proof:  We  shall  first  show  that  R  3 ,,S * >  R^  .  Write  S  = 
-  n  T  n 

^  X^  log  X^  ;  then,  using  the  strong  law  of  large  numbers,  we  can 

write 


_  a  ,  ,  ,  Sn  ,  (X.\  a 

“n  -  v +  1o8U  ■  T '  l08w "  c 


<V“+“)  -(^*---)  ~v 

[<Sn-a)  +  ot-2  {JE-y^u-1  -  u’bx-M)  -  | 
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"'1  I*!1"**! 

1 


X.  + 


“) 


where 

Z.  =  y^fx.logX.  -  ~  X±  -  X.  +  u)  . 

The  result  follows  since  ES  =  a  ,  and  X.  log  X.  are  independent  and 

n  11 

identically  distributed.  To  prove  the  statement  of  the  theorem,  we  note 
that  the  Z^'s  are  independent  and  identically  distributed  with 


EZ.  =0  , 

l 

and 

2  2 

EZJ  =  a  (F)  , 


and  invoke  the  central  limit  theorem. 


// 


3.  A  Test  for  Exponent iality  Based  on  R 
_ n 

Since  R  is  scale-free,  we  can  use  the  result  of  Theorem  2.1  to 
n 

obtain  a  test  for  exponentiality  by  taking  the  underlying  distribution, 
say  G  ,  to  be  a  unit  exponential;  that  is,  G(x)  =  1-e  x  .  Using  the 
fact  that  under  G  ,  E(XlogX)  =  r^(2)  =  .4227  ,  and  that  E(X2  log  X)  = 
r1  (3)  =  1.8455  ,  we  can  show  that  a2(G)  =  E(XlogX  -  X  E  X  log  X  -  X+l)2  = 

1  — x 

.2898664  ;  F  (•)  is  the  digamma  function.  Thus,  when  G(x)  =  1-e  , 

✓n(R  -R_)  5  N(0,. 2898664)  , 
n  G 

and  so  the  above  result  can  be  used  to  test  for  exponentiality  when  n 
is  large,  using  standard  procedures  for  testing  hypotheses. 

The  exact  distribution  of  R  when  the  underlying  distribution 

n 

is  G  is  not  known.  Consequently,  the  quantiles  of  ,  for  1  <  n  < 
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30  ,  are  obtained  through  a  Monte  Carlo  simulation  involving  10,000  ran¬ 
dom  samples  each  of  size  n  ,  generated  from  G  using  the  subroutine 
GGAMR  of  the  International  Mathematical  and  Statistical  Library  program 
package.  These  quantiles  are  given  in  Table  3.1. 

3.1  Power  of  the  Test  for  Exponentiality 
Based  on  R 

n 

To  study  the  power  of  a  test  for  exponentiality  based  on 

against  several  alternatives,  and  to  compare  the  performance  of  this 
test  versus  other  competing  scale-free  tests,  we  have  to  again  resort  to 
a  Monte  Carlo  simulation.  For  convenience,  we  choose  n=20  so  that  our 
test  statistic  becomes  R^g  ,  and  we  take  1000  replications  of  the  test. 

The  significance  level  is  taken  to  be  .05,  and  the  alternatives  consid¬ 
ered  are  the  Weibull,  the  uniform,  the  Pareto,  the  gamma,  and  the  log¬ 
normal.  Depending  upon  the  nature  of  the  alternative,  either  the  one¬ 
sided  or  both  the  one-sided  and  the  two-sided  tests  are  studied.  One¬ 
sided  tests  are  used  whenever  the  alternative  F  is  star  (anti-star) 
ordered  with  respect  to  the  exponential  distribution  G  ,  because  then, 
Rj.  <(>)  Rg  [Chandra  and  Singpurwalla  (1980)].  Also,  whenever 

R^  <(>)  R„  ,  the  test  based  on  R  is  unbiased  against  the  alternative 
F  —  "•  G  n 

F  ,  in  the  sense  of  Lehman  and  Sheffe.  The  results  of  the  power  studies 
are  summarized  in  column  2  of  Table  3.2.  In  computing  the  entries  in 
column  2,  the  quantiles  of  R^q  given  in  Table  3.1  are  used. 


3.2  Comparison  with  Other  Tests 

Of  the  several  goodness  of  fit  tests  for  the  exponential  distri¬ 
bution  which  are  available  in  the  literature  (Stephens,  1978),  the  one 
which  appears  to  be  the  most  recent  and  which  enjoys  the  advantages  of 
good  power,  good  asymptotic  relative  efficiency,  robustness  to  measure¬ 
ment  error,  etc.,  is  the  one  based  on  the  Gini  statistic  (Gail  and 

Gastwirth,  1978),  where 
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TABLE  3 

.2 

— - - —  - 

POWER  COMPART SONS  FOR  A  TEST  FOR 

EXPONENTIALITY  BASED  ON  THE  i 

REDUNDANCY  AND  THE 

GINI  STATISTIC  (based 

on  1 . GOO 

samples 

of  size  n=20,  using  a  significance  level  of  cv= 

0.05) 

R20 

G20 

Alternat ive 

1-Sided 

2-Sided 

] -Sided 

2-Sided 

Test 

Test 

Test 

Test 

Weibull  shape  =0.8 

0.346 

t 

0.336 

-i- 

Weibull  shape  =  1.5 

0.643 

t 

0.643 

+ 

Uniform  on  (0,2) 

0.826 

t 

0.829 

t 

2 

Pareto  DF  1  -  l/4x 

0.830 

T 

0.835 

t 

Gamma  shape  2 . 0 

0.631 

t 

0.622 

+ 

Lognormal*  0=0.6,  U=1 

0.873 

t 

0.873 

+ 

Lognormal*  0=1.0,  U=1 

0.144 

0.154 

0.117 

0.126 

Lognormal*  0=1.4,  p=l 

0.629 

+ 

0.620 

t 

*f(x)  =- — — —  exp(-(log  x  -  M)2  /  o2  , 
Ox/ 2TT 

,  0  <  x  < 

°o  t 

+A  two-sided  test  is  inappropriate 

for  this 

case . 

J.  i(n_i)(x(i+i) 'xu)) 

Gn  "  - - T -  ’ 

(n-1)  l  X 

i=l  v  ' 

X.,,  <  ...  <  X.  „  are  the  order  statistics  of  X,  ,...,X 
(1)  (n,  1  n 

Thus  it  appears  that  it  is  appropriate  for  us  to  compare  the  test 
for  exponentiality  based  on  R^  with  the  one  based  on  0^  with  respect 

to  both  the  power  and  the  asymptotic  relative  efficiency.  Another  rea¬ 
son  for  choosing  G  as  a  basis  for  comparison  is  that  Chandra  and 

Singpurwalla  (1980)  have  shown  that  a  test  for  exponentiality  based  on 

G  is  identical  to  the  one  based  on  the  "total  time  on  test"  of  Barlow 
n 


I 
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and  Doksum  (1972).  Since  this  latter  test  is  known  to  be  asymptotically 
minimax  against  a  large  class  of  alternatives  defined  by  the  Kolmogorov 
distance,  so  is  the  test  based  on  G 

n 

In  column  3  of  Table  3.2  we  give  the  powers  of  the  tests  for  ex- 
ponentiality  based  on  against  the  several  alternatives  mentioned 

before.  The  entries  in  column  3  of  Table  3.2  have  been  taken  from  Table 
1  of  Gail  and  Gastwirth  (1978)  for  all  the  alternatives  considered  ex¬ 
cept  the  lognormal — this  alternative  was  not  considered  by  the  above 
referenced  authors.  The  powers  of  against  the  lognormal  alter¬ 

natives  were  obtained  by  us  with  a  Monte  Carlo  simulation  of  1000 
replications. 

From  a  comparison  of  the  entries  in  columns  2  ar.d  3  of  Table  3.2, 
we  see  that  the  two  tests  for  exponentiality  based  on  R^g  or  G^g 

perform  equally  well  or  poorly,  depending  on  the  alternative  considered. 
For  the  lognormal  alternative  with  0=1  and  p  =  1  ,  the  test  based 
on  f^g  appears  to  have  a  slight  edge  over  the  one  based  or.  G2Q  . 

Both  these  tests  have  low  power. 

A  reason  why  the  tests  for  exponentiality  based  on  the  sample 
redundancy  and  the  Gini  index  have  low  power  for  the  lognormal  alterna¬ 
tive  with  p  =  1  and  O  *  1  ,  is  that  this  distribution  is  neither 
star  nor  anti-star  ordered  with  respect  to  the  exponential ,  whereas  the 
other  alternatives  are.  Furthermore,  Chandra  and  Singpurwalla  (1980) 
also  show  that  when  a  distribution  F  is  star  (anti-star)  ordered  with 
respect  to  the  exponential  distribution  G  ,  then  G  ,  the  Gini  index 

r 

of  F  ,  is  less  (greater)  than  G  ,  the  Gini  index  of  G  .  Thus,  it  is 
not  possible  to  claim  unbiasedness  of  either  R^g  or  G„g  for  the  log¬ 
normal  alternative  in  question,  and  our  conclusion  is  that  neither  the 
test  based  on  the  Gini  index  nor  the  test  based  on  the  redundancy  should 
be  used  under  the  circumstances  described  above. 
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R^O)  “  ♦  where  i^(x)  =  d  log  l'(x)/dx  .  Hence  - 

^(l)(2)_i  =  -0.355066 8  ,  and  from  (4.2)  we  get  the  A.R.E.  as  0.674. 

For  the  Weibull  alternatives  F(x,B)  *  y  (x/A)’  ^expt-Cx/A)^]  , 
the  variance  of  /nf?  is  V  ■  0.60793  ,  again  independent  of  A  ,  and 

RF(6)  "  l4  +  i)  *  l08l’(1  +  i)  ’  Hence  RF(l)  =  ~^2>  -  ^(l)(2)  +  !K2)  - 
-0.6449332  ,  and  from  (4.2)  we  get  the  A.R.E.  as  0.872. 
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